Semi-Supervised Fuzzy Clustering with Feature Discrimination
نویسندگان
چکیده
Semi-supervised clustering algorithms are increasingly employed for discovering hidden structure in data with partially labelled patterns. In order to make the clustering approach useful and acceptable to users, the information provided must be simple, natural and limited in number. To improve recognition capability, we apply an effective feature enhancement procedure to the entire data-set to obtain a single set of features or weights by weighting and discriminating the information provided by the user. By taking pairwise constraints into account, we propose a semi-supervised fuzzy clustering algorithm with feature discrimination (SFFD) incorporating a fully adaptive distance function. Experiments on several standard benchmark data sets demonstrate the effectiveness of the proposed method.
منابع مشابه
Leaf classification using multiple feature analysis based on semi-supervised clustering
Multiple features such as the margin, the shape and the texture of plant leaves are of great importance for classification of plant species, as they are often regarded as the unique features to identify plants. In this paper, we study the performance of a recently proposed semi-supervised fuzzy clustering algorithm with feature discrimination for leaf classification, based on features generated...
متن کاملEnhancement of fuzzy clustering by mechanisms of partial supervision
Semi-supervised (or partial) fuzzy clustering plays an important and unique role in discovering hidden structure in data realized in presence of a certain quite limited fraction of labeled patterns. The objective of this study is to investigate and quantify the effect of various distance functions (distances) on the performance of the clustering mechanisms. The underlying goal of endowing the c...
متن کاملFuzzy Clustering with Pairwise Constraints for Knowledge-Driven Image Categorization
The identification of categories in image databases usually relies on clustering algorithms that only exploit the feature-based similarities between images. The addition of semantic information should help improving the results of the categorization process. Pairwise constraints between some images are easy to provide, even when the user has a very incomplete prior knowledge of the image catego...
متن کاملImprove Semi-Supervised Fuzzy C-means Clustering Based On Feature Weighting
Semi-supervised learning is somewhere between unsupervised and supervised learning. In fact, most semi-supervised learning strategies are based on extending either unsupervised or supervised learning to include additional information typical of the other learning paradigm. Constraint fuzzy c-means a novel semi-supervised fuzzy c-means algorithm proposed by Li et al [1]. Constraint FCM like FCM ...
متن کاملDocument Clustering Based On Semi-Supervised Term Clustering
The study is conducted to propose a multi-step feature (term) selection process and in semi-supervised fashion, provide initial centers for term clusters. Then utilize the fuzzy c-means (FCM) clustering algorithm for clustering terms. Finally assign each of documents to closest associated term clusters. While most text clustering algorithms directly use documents for clustering, we propose to f...
متن کامل